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GenCore version 5.1.8 

Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



May 31, 2006, 10:18:03 ; Search time 17609 Seconds 

(without alignments) 
11497.390 Million cell updates/sec 

US-10-501-930-2 
3166 

1 cggagcggcgagtcggtgcc gctattagcaaaaaaaaaaa 3166 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 



12732272 



Minimum DB seq length: 
Maximum DB seq length: 



0 

2000000000 



Post-processing: 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



GenEmbl : 



1 




gb_env: 


* 


2 




gb_pat : 




3 




gb_ph : * 




4 




gb_pl : * 




5 




gb_pr : * 




6 




gb_ro : * 




7 




gb_sts : 


* 


8 




gb_sy : * 




9 




gb un : * 




10 


gb_vi : 


★ 


11 


gb ov : 


★ 


12 


gb_htg 


: * 


13 


gb_in : 




14 


gb_om : 


* 


15 


gb_ba : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 





No. 


Score 


Match 


Length 


DB 


ID 






1 


3166 


100 


.0 


3166 


6 


AB059554 




c 


2 


2361 


74 


. 6 


110000 


12 


BX322642 


0 




3 


2361 


74 


. 6 


110000 


12 


BX322642' 


2 


c 


4 


2361 


74 


.6 


110000 


12 


BX322642' 


2 


c 


5 


2361 


74 


.6 


202001 


6 


AL772303 






6 


1408.4 


44 


.5 


236283 


12 


AC120773 




c 


7 


1408.4 


44 


.5 


264245 


12 


AC096804 





Description 



AB059554 Mus muscu 
BX322642 Mus muscu 
Continuation (3 of 
Continuation {3 of 

AL772303 Mouse DNA 
AC120773 Rattus no 
AC096804 Rattus no 
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ALIGNMENTS 



RESULT 1 
AB059554 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



AB059554 3166 bp mRNA linear ROD 27-APR-2005 

Mus musculus ST8Sia VI mRNA for alpha 2 , 8-sialyltransf erase , 
complete cds . 
AB059554 

AB059554.1 GI:21668466 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Takashima, S . , Ishida,H.K., Inazu,T., Ando,T., Ishida,H., Kiso,M., 
Tsuji^S. and Tsujimoto^M. 

Molecular cloning and expression of a sixth type of alpha 
2, 8-sialyltransferase (ST8Sia VI) that sialylates 0-glycans 
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JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



J. Biol. Chem. 277 (21), 24030-24038 (2002) 
11980897 

2 (bases 1 to 3166) 
Takashima, S . 
Direct Submission 

Submitted ( 07-APR-2001 ) Shou Takashima, RIKEN, Cellular 
Biochemistry Laboratory; 2-1 Hirosawa, Wako, Saitama, 351-0198, 
Japan ( E-mail : stakaQri ken . jp, Tel: 81-4 8-4 62-1111 (ex. 3424), 
Fax:81-48-462-4670) 

Location /Qualifiers 

1. .3166 

/organism="Mus musculus" 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
/count ry= 11 Japan" 
1. .3166 

/gene= n ST8Sia VI" 
77. .1273 
/gene="ST8Sia VI" 
/codon_start=l 

/product=" alpha 2, 8 -sialyl transferase" 
/protein_id="BAC01265. 1" 
/db_xref="GI :21668467" 

/trans la tion="MRSGGTLFALIGSLMLLLLLRMLWCPADAPARSRLLMEGSREDT 
SGTSAALKTLWSPTTPVPRTRNSTYLDEKTTQITEKCKDLQYSLNSLSNKTRRYSEDD 
YLQTITNIQRCPWNRQAEEYDNFRAKLASCCDAIQDFVVSQNNTPVGTNMSYEVESKK 
HIPIRENIFHMFPVSQPFVDYPYNQCAVVGNGGILNKSLCGAEIDKSDFVFRCNLPPI 
TGSASKDVGSKTNLVTVNPSI ITLKYQNLKEKKAQFLEDISTYGDAFLLLPAFSYRAN 
TGISFKVYQTLKESKMRQKVLFFHPRYLRHLALFWRTKGVTAYRLSTGLMIASVAVEL 
CENVKLYGFWPFSKTIEDTPLSHHYYDNMLPKHGFHQMPKEYSQMLQLHMRGILKLQF 
SKCETA" 



ORIGIN 



Query Match 100.0%; Score 3166; DB 6; Length 3166; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 3166; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 



1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



ACGCCCGTGGCTCAGGATGAGATCGGGGGGCACGCTGTTCGCCCTCATAGGCAGCCTGAT 120 

I I i I | I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACGCCCGTGGCTCAGGATGAGATCGGGGGGCACGCTGTTCGCCCTCATAGGCAGCCTGAT 120 

GCTGCTGCTCCTCCTGCGTATGCTCTGGTGCCCAGCCGACGCGCCTGCCCGCTCCAGGCT 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 
GCTGCTGCTCCTCCTGCGTATGCTCTGGTGCCCAGCCGACGCGCCTGCCCGCTCCAGGCT 180 

GTTGATGGAGGGAAGCAGAGAGGACACC AGTGGTACCTCAGCTGCACTGAAGACACTCTG 24 0 

I | I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
GTTGATGGAGGGAAGCAGAGAGGACACCAGTGGTACCTCAGCTGCACTGAAGACACTCTG 2 4 0 

GAGCCCGAC AACCCCGGTACC ACGCACC AGGAAC AGCACATATCTGGATGAGAAGACAAC 300 

I | | | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
GAGCCCGAC AACCCCGGTACC ACGCACC AGGAACAGCACATATCTGGATGAGAAGACAAC 300 

CCAAATAACAGAGAAATGCAAAGATCTGCAATATAGCTTGAACTCTTTATCTAACAAAAC 360 

I | | | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAAATAAC AGAG AAATGC AAAGATCTGCAATATAGCTTG AACTCTTTATCT AAC AAAAC 360 
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Qy 361 GAGACGGTACTCTGAGGATGACTACCTCCAGACCATCACAAACATACAGAGATGCCCATG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GAGACGGTACTCTGAGGATGACTACCTCCAGACCATCACAAACATACAGAGATGCCCATG 42 0 

Qy 421 GAACCGGCAAGCAGAAGAATATGACAATTTTAGAGCAAAACTGGCTTCCTGTTGCGATGC 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I 
Db 421 GAACCGGCAAGCAGAAGAATATGACAATTTTAGAGCAAAACTGGCTTCCTGTTGCGATGC 48 0 

Qy 4 81 CATTCAAGACTTCGTGGTTTCCCAGAACAACACTCCAGTGGGGACTAACATGAGCTACGA 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 81 CATTCAAGACTTCGTGGTTTCCCAGAACAACACTCCAGTGGGGACTAACATGAGCTACGA 54 0 

Qy 541 GGTGGAAAGCAAGAAACACATCCCCATTCGAGAGAACATTTTCCACATGTTTCCAGTGTC 600 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 541 GGTGGAAAGCAAGAAACACATCCCCATTCGAGAGAACATTTTCCACATGTTTCCAGTGTC 600 

Qy 601 GCAGCCTTTTGTGGACTATCCCTATAACCAGTGTGCAGTGGTTGGTAATGGGGGAATTCT 660 

I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 601 GCAGCCTTTTGTGGACTATCCCTATAACCAGTGTGCAGTGGTTGGTAATGGGGGAATTCT 660 

Qy 661 CAACAAGTCTCTCTGCGGAGCAGAAATTGATAAATCTGACTTCGTCTTCAGGTGTAACCT 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 CAACAAGTCTCTCTGCGGAGCAGAAATTGATAAATCTGACTTCGTCTTCAGGTGTAACCT 720 

Qy 7 21 CCCCCCAATCACAGGGAGCGCTAGTAAAGATGTTGGAAGCAAAACAAATCTTGTGACTGT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 CCCCCCAATCACAGGGAGCGCTAGTAAAGATGTTGGAAGCAAAACAAATCTTGTGACTGT 78 0 

Qy 7 81 CAATCCCAGCATTATAACCCTGAAGTACCAGAATTTGAAGGAGAAGAAAGCACAGTTTTT 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 7 81 CAATCCCAGCATTATAACCCTGAAGTACCAGAATTTGAAGGAGAAGAAAGCACAGTTTTT 84 0 

Qy 841 GGAGGACATCTCCACCTATGGAGATGCATTCCTCCTCCTGCCAGCATTTTCCTATCGGGC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 GGAGGACATCTCCACCTATGGAGATGCATTCCTCCTCCTGCCAGCATTTTCCTATCGGGC 900 

Qy 901 CAACACAGGCATCTCTTTTAAAGTCTACCAAACACTCAAAGAGTCAAAAATGAGGCAAAA 960 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 CAACACAGGCATCTCTTTTAAAGTCTACCAAACACTCAAAGAGTCAAAAATGAGGCAAAA 960 

Qy 961 GGTTCTCTTCTTCCATCCCAGGTACCTGAGACACCTCGCTCTTTTCTGGAGAACTAAAGG 1020 

I I I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 961 GGTTCTCTTCTTCCATCCCAGGTACCTGAGACACCTCGCTCTTTTCTGGAGAACTAAAGG 1020 

Qy 1021 GGTGACTGCATACCGCTTGTCCACAGGCTTGATGATTGCAAGTGTCGCTGTGGAACTGTG 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 GGTGACTGCATACCGCTTGTCCACAGGCTTGATGATTGCAAGTGTCGCTGTGGAACTGTG 1080 

Qy 1081 TGAAAACGTGAAGCTCTACGGATTCTGGCCTTTCTCTAAGACTATCGAAGACACCCCACT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 TGAAAACGTGAAGCTCTACGGATTCTGGCCTTTCTCTAAGACTATCGAAGACACCCCACT 114 0 

Qy 1141 CAGTCACCACTACTATGATAACATGTTACCTAAGCATGGTTTCCACCAGATGCCTAAAGA 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 CAGTCACCACTACTATGATAACATGTTACCTAAGCATGGTTTCCACCAGATGCCTAAAGA 1200 

Qy 1201 ATACAGCCAAATGCTCCAGCTCCATATGAGAGGAATCCTCAAACTGCAATTCAGCAAATG 1260 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 01 ATACAGCCAAATGCTCCAGCTCCATATGAGAGGAATCCTCAAACTGCAATTCAGCAAATG 12 60 

Qy 12 61 TGAAACGGCTTAACGTTTCTTAGAAGGAGAATAATTTCAGGAGGTGGAGTGGATGTGTCA 1320 
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! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 12 61 TGAAACGGCTTAACGTTTCTTAGAAGGAGAATAATTTCAGGAGGTGGAGTGGATGTGTCA 1320 

Qy 1321 CAGCATCTCCAAAAAGCCAATAGAAGAAGGCACAGAGAAAGCATGAATTACAAAGGCGCT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 CAGCATCTCCAAAAAGCCAATAGAAGAAGGCACAGAGAAAGCATGAATTACAAAGGCGCT 1380 

Qy 1381 CTCCCACTTGTCTAGACCAAAGCCACCCGCCCCCACTCACTTTGCAGCCTCCACGAGTCA 14 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 CTCCCACTTGTCTAGACCAAAGCCACCCGCCCCCACTCACTTTGCAGCCTCCACGAGTCA 14 4 0 

Qy 14 41 CTCATTCTCACCTTCAACGTTCTTTCTCTGAGAATAGAGACCAAAACATCAGACTTGGAT 1500 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 41 CTCATTCTCACCTTCAACGTTCTTTCTCTGAGAATAGAGACCAAAACATCAGACTTGGAT 1500 

Qy 1501 AAGTAAAATGAGATAATTTTTCAAATCATCATAGAATTTGATTTGAGCCAGGGTCTCTCA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGTAAAATGAGATAATTTTTCAAATCATCATAGAATTTGATTTGAGCCAGGGTCTCTCA 15 60 

Qy 1561 GAATGCTTCCTTGTTCCTATCCATGATAGCCATTCCCACCTTTATCAGAGTGGTAATGAA 1620 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1561 GAATGCTTCCTTGTTCCTATCCATGATAGCCATTCCCACCTTTATCAGAGTGGTAATGAA 1620 

Qy 1621 ACTGTGCAATTGTGCCAAAGACCCTTTCTGAAGAGAATGTCTGAATCATGCGCCGAGTTT 1680 

I I I I I I I I I i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 
Db 1621 ACTGTGCAATTGTGCCAAAGACCCTTTCTGAAGAGAATGTCTGAATCATGCGCCGAGTTT 1680 

Qy 1681 TTACACACAGCTCTTCCTTTATAAATAAATCCTTCCCATTCTCCCTCCTAGTAGAGTACA 17 4 0 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 TTACACACAGCTCTTCCTTTATAAATAAATCCTTCCCATTCTCCCTCCTAGTAGAGTACA 1740 

Qy 17 41 GAAACAAAATACCCTTGATGATTCAGGAAGAAAAGTCTTTTTTACTTAGCAATGTGCCTG 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 17 41 GAAACAAAATACCCTTGATGATTCAGGAAGAAAAGTCTTTTTTACTTAGCAATGTGCCTG 18 00 

Qy 1801 CTTCTGATTCAGTTCGCTTGTGACATTAAGCTGGGTTGGGGTTTTGGTTGGATTTGGGGC 1860 

I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 CTTCTGATTCAGTTCGCTTGTGACATTAAGCTGGGTTGGGGTTTTGGTTGGATTTGGGGC 1860 

Qy 1861 GTTTCTTCACTTCTTTTGTCTATATTTTCCTTACCTTTATCAGTTTGTATTCGAGCTTCC 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 18 61 GTTTCTTCACTTCTTTTGTCTATATTTTCCTTACCTTTATCAGTTTGTATTCGAGCTTCC 1920 

Qy 1921 TGCTTTGGGATTCTGCAATTCTCTCTCCCACTGACAGGATCAACTCAATGACATAAAGTA 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 TGCTTTGGGATTCTGCAATTCTCTCTCCCACTGACAGGATCAACTCAATGACATAAAGTA 1980 

Qy 1981 GTTCAAACATCCATTGCTTCTCACATGTTTTATCCATAAAGTTACTCATCTGATTTTATT 204 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 GTTCAAACATCCATTGCTTCTCACATGTTTTATCCATAAAGTTACTCATCTGATTTTATT 204 0 

Qy 2041 TAAAATAGTGAACATCTACTTGATATCAGACCCGAGGACCATCCTCCATTGGAGAATATG 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 TAAAATAGTGAACATCTACTTGATATCAGACCCGAGGACCATCCTCCATTGGAGAATATG 2100 

Qy 2101 AAGATATTGTCACTGGCAGAAAAGCAGGTGTGTGCCATTAATTGATAAGATACCACAAGC 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 2101 AAGATATTGTCACTGGCAGAAAAGCAGGTGTGTGCCATTAATTGATAAGATACCACAAGC 2160 

Qy 2161 ATCATCATGCCAGTTATGAACACAGTGCTGAAAGGATCATAGACAGGGGTGGTTAAATCT 2220 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I i I I I I I I I I I I I I I I I 
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Db 2161 ATCATCATGCCAGTTATGAACACAGTGCTGAAAGGATCATAGACAGGGGTGGTTAAATCT 2220 

Qy 2221 GATCCCAGTAGAATAAACTTCAGTGTACCTATTTCAGGGAAGAGTTAATTTCACAATTAA 22 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2221 GATCCCAGTAGAATAAACTTCAGTGTACCTATTTCAGGGAAGAGTTAATTTCACAATTAA 22 80 

Qy 2281 AACTAGTAAATGAACCAATTCTTAGGCACATTAAGTGGATTCTGAGTAAAAGAAAGGGAA 234 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 AACTAGTAAATGAACCAATTCTTAGGCACATTAAGTGGATTCTGAGTAAAAGAAAGGGAA 234 0 

Qy 2341 CAGCAGGAGAAAGCTGTTCGCTTGGTTCTGATTACCCAAATGAGCATGCTGGAAGGAGGT 2 4 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2341 CAGCAGGAGAAAGCTGTTCGCTTGGTTCTGATTACCCAAATGAGCATGCTGGAAGGAGGT 24 00 

Qy 24 01 TGTGAGGCTACGCTAAAACCTCTGCGTAGGGAGAGAGTACAGTGCATGAGTGTGGCGGCT 24 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 24 01 TGTGAGGCTACGCTAAAACCTCTGCGTAGGGAGAGAGTACAGTGCATGAGTGTGGCGGCT 2 4 60 

Qy 24 61 TTTGTCCACACTCGTGAAGGGTGAGTAATTCAGAGCCAATCACATCACAAGGATGGACAC 2 52 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 24 61 TTTGTCCACACTCGTGAAGGGTGAGTAATTCAGAGCCAATCACATCACAAGGATGGACAC 252 0 

Qy 2521 ACCTAACTCATCACTTCAGGGGGAGATGAATGCTTTCATGAGAAATTACACTCATAAGCT 258 0 

I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2521 ACCTAACTCATCACTTCAGGGGGAGATGAATGCTTTCATGAGAAATTACACTCATAAGCT 2580 

Qy 2581 AAGCATCAGTTTTGAGTAAAATTTGAGTAGATGTTAAATATGAACATTTTATACCTCTTA 2 64 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2581 AAGCATCAGTTTTGAGTAAAATTTGAGTAGATGTTAAATATGAACATTTTATACCTCTTA 2 64 0 

Qy 2 641 CTAATGTCCCACCGACACCTTTTAATGTAAGCACATTTATTTATTAAGTTACTTGACATT 27 00 

I I i I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2641 CTAATGTCCCACCGACACCTTTTAATGTAAGCACATTTATTTATTAAGTTACTTGACATT 2700 

Qy 27 01 AAATGCTTATGTCTGTATATTCTGTTCATCCATCGATTTTCCCAAAAAGTAAGAGCATAG 2 7 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I t I II I I I I 
Db 27 01 AAATGCTTATGTCTGTATATTCTGTTCATCCATCGATTTTCCCAAAAAGTAAGAGCATAG 27 60 

Qy 27 61 GAGATGAGGCCTACATGCCAAGAAAACTATAAATTTTACTCTTTAATTCTTACTTGAGCC 2 820 

I 1 I I I I I I I I I I I I I I I I I I I t I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 27 61 GAGATGAGGCCTACATGCCAAGAAAACTATAAATTTTACTCTTTAATTCTTACTTGAGCC 2820 

Qy 2821 AGCTTGTTGTTTATCAAGTGCTTTTTTGAAGAGACAGCACCCTGTGAATTCTTCATTCTG 28 80 

I I 1 1 I 1 1 I I I I I I I I 1 1 I I I I i I I I I I I 1 1 I I I I I II I I I I I I I 1 1 I I I I I I I I I I I I I I 

Db 2821 AGCTTGTTGTTTATCAAGTGCTTTTTTGAAGAGACAGCACCCTGTGAATTCTTCATTCTG 2880 

Qy 28 81 ATACAGTGTCACCTTGTATTTAACATTTGTAATGTTGTTTCAAGTTTACATCTCTTTCAT 294 0 

I t t I ! I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 i I I I I I I I I I I I I I I I I I I I I I I I 
Db 28 81 ATACAGTGTCACCTTGTATTTAACATTTGTAATGTTGTTTCAAGTTTACATCTCTTTCAT 294 0 

Qy 2941 TCTTTTATAGCAAATCAAACGTATTAGCTTCAGAAATTTATCAGAAGTTCATATATAAAT 3000 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 941 TCTTTTATAGCAAATCAAACGTATTAGCTTCAGAAATTTATCAGAAGTTCATATATAAAT 3000 

Qy 3001 ATTTTGCAAAGGGTAAAAGGCTTTTTTGTTAAATAAAATAAAATTTATTATTTTCTTCTG 3060 

I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 3001 ATTTTGCAAAGGGTAAAAGGCTTTTTTGTTAAATAAAATAAAATTTATTATTTTCTTCTG 3060 

Qy 30 61 ATGAATAGAGGCTCTTTTATGCTGCTGCTAATGAACCTAATTAGCTTTAAATTATCTCCT 312 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 30 61 ATGAATAGAGGCTCTTTTATGCTGCTGCTAATGAACCTAATTAGCTTTAAATTATCTCCT 312 0 
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Qy 3121 AGCAACATTGGTCACGTTTCAATCATGCTATTAGCAAAAAAAAAAA 3166 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3121 AGCAACATTGGTCACGTTTCAATCATGCTATTAGCAAAAAAAAAAA 3166 
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SCOI^E Search [^(asuefe [DettsioQs IFoo 3 ^ppD5caittD©ini 
a®5(Q)iS)3(Q) gioudl Search tesyDtt ys-l®=S©a-93©= 
2Lryp 0 



Score Home Retrieve A p plication SCORE Sy stem 
Page List Overview 



SCORE Comments / 
FAQ Sugg estions 



This page gives you Search Results detail for the Application 10501930 and Search Result us-10- 

501-930-1. rup. 

start 

Go Back to previous page 

GenCore version 5.1.8 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: 



May 26, 2006, 23:22:24 / Search time 302 Seconds 

(without alignments ) 
. 1219.060 Million cell updates/sec 



Title: US- 10-501-930-1 

Perfect score: 2102 



Sequence : 



1 MRSGGTLFALIGSLMLLLLL MLQLHMRGILKLQFSKCETA 398 



Scoring table: BL0SUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



2849598 seqs, 925015592 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2849598 



Database : 



UniProt_7.2:* 
1 : uniprot_sprot : * 
2: uniprot trembl:' 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2102 


100.0 


398 


1 


SIA8F_M0USE 


Q8k4tl mus musculu 


2 


2102 


100.0 


405 


2 


Q3T9N4 MOUSE 


Q3t9n4 mus musculu 


3 


1936 


92.1 


398 


2 


Q6ZXC7_RAT 


Q6zxc7 rattus norv 


4 


1922 


91.4 


362 


2 


Q8BI43 MOUSE 


Q8bi43 mus musculu 
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5 


1772 


84 . 


3 


398 


1 


SIA8F HUMAN 


P61647 


homn qani pn 


6 


1772 


84 . 


3 


398 


2 


05VZH4 HUMAN 


05vzh4 

^ *-s v (win 


homo sani pn 


7 


1763 


83 . 


9 


398 


1 


SIA8F PANTR 


P6164 8 


nan i~ y r^n ~\ oH 


8 


1597 


76. 


0 


398 


2 


Q5NDG0 BOVIN 


O5nda0 


bos taurus 


9 


1183 


56. 


3 


398 


2 


Q6ZXC6 CHICK 


06zxc6 

V w v 


crallus crall 

^ t* -X ^ LX O WX _i_ X. 


10 


728 


34 . 


6 


343 


2 


Q6KC02~~BRARE 


Q6kc02 


brachydani o 


11 


727.5 


34 . 


6 


379 


2 


O6KC00 TETNG 


O6kc00 




12 


725.5 


34 . 


5 


378 


2 


O5K019 ORYLA 


O5k019 


orv7i as la1* 


13 


719.5 


34 . 


2 


372 


2 


O5 0J4 0 FUGRU 


OSOi 4 n 


~f\~in'\i Tiibr"i"n 

luy U -L U1J J — L 


14 


719.5 


34 . 


2 


*?7 9 


2 




L^ V? JVL» v JL 


■{•n rfll Tllh T" 1 n 

luyu i ujji ip 


15 


715.5 


34 . 


0 


356 


2 


06KBZ7 BRARE 




b"r~Pi cht wHpi ~n "i o 


16 


710.5 


33 . 


8 


342 


2 


P797R1 CHTCK 


P7 97 83 


rra 1 1 n cj frail 
yanuo yctii 


17 


703 


33 . 


4 


356 


1 


SIA8A PANTR 


P61642 


T^an t~ y o rr 1 nH 
pan L. x. y J. Wv^i 


18 


702 


33 . 


4 


342 


2 


Q78EI9 9MURI 


Q78ei9 


ra 1" t - u s st™> 


19 


701 


33 . 


3 


356 


2 


P7 0554 RAT 


P70554 


rat-j-iiq nnrT7 


20 


700 


33 . 


3 


356 


1 


SIA8A HUMAN 


092 185 


liwiuw oapicn 


21 


700 


33 . 


3 


356 


2 


06ZXD2 ROVTN 




UUO L. Cl Li X, U. o 


22 


700 


33 . 


3 


412 


1 


SIA8E MOOSE 




HLU O 111UOLU1U 


23 


699.5 


33 . 


3 


370 


2 


O6KBZ8 FUGRU 


LJ J^- L/ O 


fnmi ruhr i n 


24 


697 


33 . 


2 


355 


2 


08BL76 MOUSE 

^ XJ XJ f W L JVvtJlJ 


ORbl 76 


miiq miicnil 11 
11LLX O 1UU J L- U 1 U 


25 


697 


33 . 


2 


355 


2 


08K1C1 MOUSE 


U 1 1 


mi i «3 mi lQniln 

1LIU O It I LI O V-« U -L Li 


26 


692 


32 . 


9 


376 


2 


03TRR3 MOUSE 


Q3trr3 


mn q miiQni 1 n 

1UU O 1UU O ^ U X u 


27 


691 


32 . 


9 


355 


2 


Q8BWI0~MOUSE 


O8bwi0 


mn s mi 3 sculu 

lUUw 1UU<J ^ U X Ll 


28 


689 


32 . 


8 


341 


2 


064468 MOUSE 


064 4 68 


mnc; mnqpii] ii 


29 


688 


32 . 


7 


376 


2 


08JZ03 MOUSE 


08 *i za3 


mil mnenil n 
ULU l? UlU J L ux u 


30 


687.5 


32 . 


7 


359 


2 


O6ZXA0 XENLA 




ypnnnn ^ 1 ao 


31 


686 


32 . 


6 


374 


2 


05NDF9 BOVIN 


05ndf 9 


bo 9 fanril 9 


32 


68 6 


32 . 




37 6 


x 


STA8F PANTR 

ulnUL IT n LN X X\ 


roiuio 


nan frnnl 
JJctll L. 1. \J y X \J Ll 


33 


686 


32 . 


6 


376 


2 


Q6IAW7 HUMAN 


06iaw7 


homo sani pn 


34 


686 


32 . 


6 


376 


2 


Q6ZXC8~*RAT 


Q6zxc8 


rattus norv 


35 


685 


32 . 


6 


419 


2 


04SUV1 TETNG 


04 sii vl 


fpf randnn n 

1— V3 L X QVJ v_l 11 11 


36 


682 . 5 


32 . 


5 


338 


2 


06WRU1 XENLA 


Dfiwrn 1 


ypnnnn lap 
ACiiujJUo _Lo.tr 


37 


682 


32 . 


4 


355 


1 


SIA8A MOUSE 


064 687 


mn s mn srul n 


38 


678 


32 . 


3 


37 6 


2 


06ZX98 CHICK 


y U ti A ^ U 


rra 11ns rra 1 1 
y ci j — l u o y an 


39 


677.5 


32 . 


2 


34 5 


2 


OfiDNHfi XFNTR 




ACllULJUi L i. Ly 


40 


67 7 


32 . 


2 


37 6 


1 


SIA8E HUMAN 


Ol S4 fifi 


homo qani e^rt 


41 


674.5 


32. 


1 


275 


2 


Q3V3B1 MOUSE 


Q3v3bl 


mus musculu 


42 


670 


31. 


9 


369 


2 


Q6KBZ9_FUGRU 


Q6kbz9 


fugu rubrip 


43 


665.5 


31. 


7 


268 


2 


Q50J35_BRARE 


Q50j35 


brachydanio 


44 


665. 5 


31. 


7 


298 


2 


Q4T001_TETNG 


Q4t001 


tetraodon n 


45 


654 .5 


31. 


1 


335 


2 


Q6KC12_TETNG 


Q6kcl2 


tetraodon n 



ALIGNMENTS 



RESULT 1 
SIA8F_M0USE 

ID SIA8F_M0USE STANDARD; PRT; 398 AA. 

AC Q8K4T1; 

DT 07-JUN-2004, integrated into UniProtKB/Swiss-Prot . 

DT 01-OCT-2002, sequence version 1. 

DT 07-FEB-2006, entry version 24. 

DE Alpha-2, 8-sialyltransf erase 8F (EC 2.4.99.-) (ST8Sia VI). 

GN Name=St8sia6; Synonyms=Siat8f ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 
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RP NUCLEOTIDE SEQUENCE [MRNA] , AND CHARACTERIZATION. 

RX MEDLINE=22086185; PubMed=l 1980897 ; DOI=10 . 1074/ jbc . M112367200; 

RA Takashima S., Ishida H.K., Inazu T., Ando T., Ishida H . , Kiso M., 

RA Tsuji S., Tsujimoto M. / 

RT "Molecular cloning and expression of a sixth type of alpha 2,8- 

RT sialyltransf erase (ST8Sia VI) that sialylates O-glycans . " ; 

RL J. Biol. Chem. 277:24030-24038(2002). 

CC -!- FUNCTION: Prefers 0-glycans to N-glycans or glycolipids as 

CC acceptor substrates. The minimal acceptor substrate is the NeuAc- 

CC alpha-2, 3 { 6) -Gal sequence at the nonreducing end of their 

CC carbohydrate groups. 

CC -!- PATHWAY: Glycosylation . 

CC -!- SUBCELLULAR LOCATION: Golgi apparatus; Golgi membrane; single-pass 
CC type II membrane protein (Potential). 

CC -!- SIMILARITY: Belongs to the glycosyltransf erase 29 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AB059554; BAC01265.1; -; mRNA. 

DR Ensembl; ENSMUSG000000034 18 ; Mus musculus. 

DR MGI; MGI:2386797; st8sia6. 

DR GO; GO: 0008373; F : sialyltransf erase activity; IDA. 

DR GO; GO:0016051; P : carbohydrate biosynthesis; IDA. 

DR GO; GO:0009247; P:glycolipid biosynthesis; IDA. 

DR GO; GO: 0006493; P:protein amino acid O-linked glycosylation; IDA. 

DR InterPro; IPR001675; Glyco_trans_29 . 

DR InterPro; IPR012163; Sialyl_trans . 

DR Pfam; PF00777; Glyco_transf_2 9 ; 1. 

DR PIRSF; PIRSF005557; Sialyl_trans ; 1. 

KW Glycoprotein; Glycosyltransf erase ; Golgi stack; Membrane; 

KW Signal-anchor; Transferase; Transmembrane. 



FT 


CHAIN 


1 


398 




Alpha-2 , 8-sialyltransf erase 8F. 


FT 










/FTId=PRO_0000149300. 


FT 


TOPO_DOM 


1 


3 




Cytoplasmic (Potential). 


FT 


TRANSMEM 


4 


24 




Signal-anchor for type II membrane 


FT 










protein (Potential) . 


FT 


TOPO DOM 


25 


398 




Lumenal (Potential) . 


FT 


CARBOHYD 


66 


66 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


93 


93 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


151 


151 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


196 


196 




N-linked (GlcNAc. . .) (Potential). 


FT 


DISULFID 


186 


335 




By similarity. 


SQ 


SEQUENCE 


3 98 AA; 


45428 


MW; 


A73A7A29640D4917 CRC64; 


Query Match 




100. 


0%; 


Score 2102; DB 1; Length 398; 


Best Local Similarity 


100. 


0%; 


Pred. No. 2.5e-156; 



Matches 398; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MRSGGTLFALIGSLMLLLLLRMLWCPADAPARSRLLMEGSREDTSGTSAALKTLWSPTTP 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MRSGGTLFALIGSLMLLLLLRMLWCPADAPARSRLLMEGSREDTSGTSAALKTLWSPTTP 60 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 
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Qy 181 YPYNQCAVVGNGGILNKSLCGAEIDKSDFVFRCNLPPITGSASKDVGSKTNLVTVNPSII 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 YPYNQCAVVGNGGILNKSLCGAEIDKSDFVFRCNLPPITGSASKDVGSKTNLVTVNPSII 240 

Qy 241 TLKYQNLKEKKAQFLEDISTYGDAFLLLPAFSYRANTGISFKVYQTLKESKMRQKVLFFH 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 41 TLKYQNLKEKKAQFLEDISTYGDAFLLLPAFSYRANTGISFKVYQTLKESKMRQKVLFFH 300 

Qy 301 PRYLRHLALFWRTKGVTAYRLSTGLMIASVAVELCENVKLYGFWPFSKTIEDTPLSHHYY 360 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
Db 301 PRYLRHLALFWRTKGVTAYRLSTGLMIASVAVELCENVKLYGFWPFSKTIEDTPLSHHYY 360 

Qy 361 DNMLPKHGFHQMPKEYSQMLQLHMRGILKLQFSKCETA 398 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 361 DNMLPKHGFHQMPKEYSQMLQLHMRGILKLQFSKCETA 398 



RESULT 2 
Q3T9N4_MOUSE 

ID Q3T9N4_MOUSE PRELIMINARY; PRT; 405 AA . 

AC Q3T9N4; 

DT ll-OCT-2005, integrated into UniProtKB/TrEMBL . 

DT ll-OCT-2005, sequence version 1. 

DT 07-FEB-2006, entry version 5. 

DE Activated spleen cDNA, RIKEN full-length enriched library, 

DE clone : F830206C17 product : sialyltransf erase 8 (alpha-2, 8- 

DE sialyltransf erase) F, full insert sequence. 

GN Name=st8sia6; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Manimalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RX MEDLINE=99279253; PubMed=l 034 9636 ; DOI=10 . 101 6/S007 6-687 9 ( 99) 03004-9/ 

RA Carninci P., Hayashizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Methods Enzymol. 303:19-44(1999). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX PubMed=16141072; DOI=10 . 1126/science . 1112014 ; 

RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N . , 

RA Oyama R., Ravasi T., Lenhard B., Wells C, Kodzius R., Shimokawa K. , 

RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M . , 

RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., 

RA Ambesi-Impiombato A., Apweiler R. , Aturaliya R.N., Bailey T.L., 

RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., 

RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Delia Gatta G., 

RA di Bernardo D. , Down T., Engstrom P., Fagiolini M . , Faulkner G., 

RA Fletcher C.F., Fukushima T., Furuno M . , Futaki S., Gariboldi M. , 

RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., 

RA Gustincich S., Harbers M . , Hayashi Y., Hensch T.K., Hirokawa N . , 

RA Hill D., Huminiecki L., Iacono M. , Ikeo K., Iwama A., Ishikawa T., 

RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., 

RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D. , Lipovich L., Liu J., 

RA Liuni S., McWilliam S., Madan Babu M. , Madera M., Marchionni L., 



http://es/ScoreAccessWeb/GetItem.action?AppId=10501930&seqId=523886&ItemNam 8/7/2006 



SCORE Search Results Details for Application 10501930 and Search Result us-10-501-9... Page 5 of 25 



RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., 

RA Mottagui-Tabar S., Mulder N . , Nakano N . , Nakauchi H., Ng P., 

RA Nilsson R . , Nishiguchi S., Nishikawa S., Nori F . , Ohara 0. , 

RA Okazaki Y . , Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., 

RA Petrovsky N . , Piazza S., Reed J. , Reid J.F., Ring B.Z., Ringwald M. , 

RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C, 

RA Schonbach C, Sekiguchi K. , Semple C.A., Seno S., Sessa L., Sheng Y., 

RA Shibata Y., Shimada H., Shimada K., Silva D . , Sinclair B., 

RA Sperling S., Stupka E . , Sugiura K . , Sultana R., Takenaka Y . , Taki K., 

RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., 

RA Ueda H.R., van Nimwegen E . , Verardo R., Wei C.L., Yagi K., 

RA Yamanishi H. , Zabarovsky E . , Zhu S., Zimmer A., Hide W. , Bult C, 

RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., 

RA Wahlestedt C, Mattick J.S., Hume D.A., Kai C, Sasaki D. , Tomaru Y., 

RA Fukuda S., Kanamori-Katayama M. , Suzuki M. , Aoki J. , Arakawa T., 

RA Iida J., Imamura K., Itoh M . , Kato T., Kawaji H., Kawagashira N., 

RA Kawashima T., Kojima M. , Kondo S., Konno H., Nakano K., Ninomiya N., 

RA Nishio T., Okada M., Plessy C, Shibata K., Shiraki T., Suzuki S., 

RA Tagami M. , Waki K. , Watahiki A . , Okamura-Oho Y., Suzuki H., Kawai J., 

RA Hayashizaki Y.; 

RT "The transcriptional landscape of the mammalian genome."; 

RL Science 309:1559-1563(2005). 

RN [3] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RX PubMed=l 614 1073; DOI=10 . 112 6/science . 1112 00 9; 

RG RIKEN Genome Exploration Research Group, and Genome Science Group 

RG (Genome Network Core Team) and the FANTOM Consortium; 

RT "Antisense Transcription in the Mammalian Transcriptome."; 

RL Science 309:1564-1566(2005). 

RN [4] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX MEDLINE=22354683; PubMed=124 66851 ; DOI-10 . 1038/nature01266; 

RA Okazaki Y., Furuno M. , Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H. , 

RA Yagi K., Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R., Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M . , Gissi C, Godzik A., Gough J . , 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H . , Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L., Miki H., 

RA Nagashima T., Numata K . , Okido T., Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N . , Pillai R., Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M., 

RA Sandelin A., Schneider C, Semple C.A., Setou M., Shimada K., 

RA Sultana R. , Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M., 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y., Watanabe Y., Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z., Zavolan M . , Zhu Y., Zimmer A . , Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H . , Nakamura M., Sakazume N . , Sato K . , 

RA Shiraki T., Waki K., Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K. , Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K . , Sasaki D., Shibata K., Shinagawa A. , 

RA Yasunishi A. , Yoshino M., Waterston R . , Lander E.S., Rogers J., 

RA Birney E., Hayashizaki Y.; 

RT "Analysis of the mouse transcriptome based on functional annotation of 
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RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [5] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RX MEDLINE=21085660; PubMed=112 17851 ; DOI-10 . 1038/35055500; 

RA Kawai J., Shinagawa A. , Shibata K., Yoshino M . , Itoh M., Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K . , Izawa M., Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y . , Gojobori T., Bono H., Kasukawa T . , Saito R. , 

RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T . , 

RA Fleischmann W. , Gaasterland T. , Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F . , Suzuki R., Tomita M . , Wagner L . , Washio T., 

RA Sakai K., Okido T., Furuno M . , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N . , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M., Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M . , Hume D.A., Kamiya M . , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M., Rodriguez I., Sakamoto N., 

RA Sasaki H., Sato K., Schoenbach C, Seya T., Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K., Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [6] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX MEDLINE=2 04 99374 ; PubMed=l 1042 159 ; DOI=10 . 1101 /gr . 145100; 

RA Carninci P., Shibata Y., Hayatsu N . , Sugahara Y., Shibata K . , Itoh M. , 

RA Konno H . , Okazaki Y., Muramatsu M. , Hayashizaki Y.; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [7] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-NOD; TISSUE=Act ivated spleen; 

RX MEDLINE=20530913; PubMed=l 107 68 61 ; DOI=10 . 1101/gr . 152 600; 

RA Shibata K. , Itoh M . , Aizawa K. , Nagaoka S., Sasaki N., Carninci P., 

RA Konno H . , Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M . , 

RA Sumi N., Ishii Y., Nakamura S., Hazama M., Nishine T. f Harada A., 

RA Yamamoto R., Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K. , 

RA Fujiwake S., Inoue K., Togawa Y., Izawa M. , Ohara E., Watahiki M. , 

RA Yoneda Y., Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., 

RA Okazaki Y., Muramatsu M . , Inoue Y., Kira A., Hayashizaki Y. ; 

RT "RIKEN integrated sequence analysis (RISA) system-384 -format 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN [8] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RA Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., 

RA Hori F., Iida J., Imamura K., Imotani K., Itoh M. , Kanagawa S., 

RA Kawai J., Kojima M., Konno H., Murata M., Nakamura M., Ninomiya N., 

RA Nishiyori H., Nomura K., Ohno M., Sakazume N., Sano H., Sasaki D. , 

RA Shibata K. , Shiraki T., Tagami M. , Tagami Y., Waki K., Watahiki A., 

RA Muramatsu M., Hayashizaki Y.; 

RL Submitted (APR-2004) to the EMBL/ GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
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CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AK172400; BAE42986.1; -; mRNA. 

DR MGI; MGI : 238 67 97 ; st8sia6. 

DR GO; GO: 0008373/ F : sialyltransf erase activity; IDA. 

DR GO; GO: 0016051; P : carbohydrate biosynthesis; IDA. 

DR GO; GO:0009247; P:glycolipid biosynthesis; IDA. 

DR GO; GO: 0006493; P:protein amino acid O-linked glycosylation; IDA. 

DR InterPro; IPR001675; Glyco_trans_29 . 

DR InterPro; IPR012163; Sialyl_trans . 

DR Pfam; PF00777; Glyco_transf_2 9 ; 1. 

DR PIRSF; PIRSF005557; Sialyl_trans ; 1. 

KW Glycosyltransf erase; Transferase . 

SQ SEQUENCE 405 AA; 46323 MW; 48B7A17BE3443487 CRC64; 



Query Match 100.0%; Score 2102; DB 2; Length 405; 

Best Local Similarity 100.0%; Pred. No. 2.5e-156; 

Matches 398; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MRSGGTLFALIGSLMLLLLLRMLWCPADAPARSRLLMEGSREDTSGTSAALKTLWSPTTP 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I 

Db 1 MRSGGTLFALIGSLMLLLLLRMLWCPADAPARSRLLMEGSREDTSGTSAALKTLWSPTTP 60 

Qy 61 VPRTRNSTYLDEKTTQITEKCKDLQYSLNSLSNKTRRYSEDDYLQTITNIQRCPWNRQAE 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VPRTRNSTYLDEKTTQITEKCKDLQYSLNSLSNKTRRYSEDDYLQTITNIQRCPWNRQAE 120 

Qy 121 EYDNFRAKLASCCDAIQDFVVSQNNTPVGTNMSYEVESKKHIPIRENIFHMFPVSQPFVD 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 I 

Db 121 EYDNFRAKLASCCDAIQDFVVSQNNTPVGTNMSYEVESKKHIPIRENIFHMFPVSQPFVD 180 

Qy 181 YPYNQCAVVGNGGILNKSLCGAEIDKSDFVFRCNLPPITGSASKDVGSKTNLVTVNPSII 240 

I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 YPYNQCAVVGNGGILNKSLCGAEIDKSDFVFRCNLPPITGSASKDVGSKTNLVTVNPSII 240 

Qy 241 TLKYQNLKEKKAQFLEDISTYGDAFLLLPAFSYRANTGISFKVYQTLKESKMRQKVLFFH 300 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 
Db 241 TLKYQNLKEKKAQFLEDISTYGDAFLLLPAFSYRANTGISFKVYQTLKESKMRQKVLFFH 300 

Qy 301 PRYLRHLALFWRTKGVTAYRLSTGLMIASVAVELCENVKLYGFWPFSKTIEDTPLSHHYY 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 PRYLRHLALFWRTKGVTAYRLSTGLMIASVAVELCENVKLYGFWPFSKTIEDTPLSHHYY 360 

Qy 361 DNMLPKHGFHQMPKEYSQMLQLHMRGILKLQFSKCETA 398 



361 DNMLPKHGFHQMPKEYSQMLQLHMRGILKLQFSKCETA 398 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 



PRELIMINARY; 



RESULT 3 
ZXC7_RAT 

Q6ZXC7_RAT 
6ZXC7; 

<JUL-2004, 
05-Jt«»<;2 004, 
07-FEB- 
Alpha-2, 8-s 
Name=SIAT8F; 

Rattus norvegicus'XRat ) . 
Eukaryota; Metazoa; Ch^rdata; 



PRT; 



398 AA. 



integrated into UniProtKB/TrEMBL . 
sequence version 1. 
entry version 10. 
ltransf erase . 



Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euaf'&t^ontoglires; Glires; Rodentia; Sciurognathi ; 
Muroidea; Muridae; Murinae ;*R4ttus . 
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